AITopics | expert score

Collaborating Authors

expert score

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

b8f10193cab43d45df9bb810637333fd-Paper-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 01:32:59 GMT

large language model, machine learning, sparsity, (20 more...)

Neural Information Processing Systems

Country:

North America > United States (1.00)
Europe (0.93)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Communications (0.98)
(2 more...)

Add feedback

b8f10193cab43d45df9bb810637333fd-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 17:47:42 GMT

large language model, machine learning, sparsity, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Michigan (0.04)
(5 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Communications (0.98)
(2 more...)

Add feedback

Engadget's best of 2025

EngadgetDec-15-2025, 12:00:00 GMT

Engadget has been reviewing the latest devices for over two decades, adding well over 100 in-depth product tests to our tally every year. For 2025, we have compiled a list of the best gear we reviewed this year based on the highest review scores in each category. From Pixel to iPad, and Switch 2 to Sony WH-1000XM6, our reviews team has spent thousands of hours testing new products this year to discover the best of the best. Now it's your turn to rediscover the best gadgets of 2025, including explanations from our editors as to why these products were rated so highly.

amazon explore, expert score, laptop, (12 more...)

Engadget

Country: North America > United States (0.04)

Industry:

Leisure & Entertainment (1.00)
Information Technology (1.00)
Semiconductors & Electronics (0.92)
(3 more...)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Domain-Specific Pruning of Large Mixture-of-Experts Models with Few-shot Demonstrations

Dong, Zican, Peng, Han, Liu, Peiyu, Zhao, Wayne Xin, Wu, Dong, Xiao, Feng, Wang, Zhifeng

arXiv.org Artificial IntelligenceMay-29-2025

Mixture-of-Experts (MoE) models achieve a favorable trade-off between performance and inference efficiency by activating only a subset of experts. However, the memory overhead of storing all experts remains a major limitation, especially in large-scale MoE models such as DeepSeek-R1(671B). In this study, we investigate domain specialization and expert redundancy in large-scale MoE models and uncover a consistent behavior we term few-shot expert localization, with only a few in-domain demonstrations, the model consistently activates a sparse and stable subset of experts on tasks within the same domain. Building on this observation, we propose a simple yet effective pruning framework, EASY-EP, that leverages a few domain-specific demonstrations to identify and retain only the most relevant experts. EASY-EP comprises two key components: output-aware expert importance assessment and expert-level token contribution estimation. The former evaluates the importance of each expert for the current token by considering the gating scores and L2 norm of the outputs of activated experts, while the latter assesses the contribution of tokens based on representation similarities before and after routed experts. Experiments on DeepSeek-R1 and DeepSeek-V3-0324 show that our method can achieve comparable performances and $2.99\times$ throughput under the same memory budget with full model with only half the experts.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2504.06792

Country: Europe > Austria > Vienna (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

Pencils to Pixels: A Systematic Study of Creative Drawings across Children, Adults and AI

Nath, Surabhi S, Schröder, Guiomar del Cuvillo y, Stevenson, Claire E.

arXiv.org Artificial IntelligenceFeb-9-2025

Can we derive computational metrics to quantify visual creativity in drawings across intelligent agents, while accounting for inherent differences in technical skill and style? To answer this, we curate a novel dataset consisting of 1338 drawings by children, adults and AI on a creative drawing task. We characterize two aspects of the drawings -- (1) style and (2) content. For style, we define measures of ink density, ink distribution and number of elements. For content, we use expert-annotated categories to study conceptual diversity, and image and text embeddings to compute distance measures. We compare the style, content and creativity of children, adults and AI drawings and build simple models to predict expert and automated creativity scores. We find significant differences in style and content in the groups -- children's drawings had more components, AI drawings had greater ink density, and adult drawings revealed maximum conceptual diversity. Notably, we highlight a misalignment between creativity judgments obtained through expert and automated ratings and discuss its implications. Through these efforts, our work provides, to the best of our knowledge, the first framework for studying human and artificial creativity beyond the textual modality, and attempts to arrive at the domain-agnostic principles underlying creativity. Our data and scripts are available on GitHub.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2502.05999

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.05)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > Germany > Saxony > Leipzig (0.04)

Genre: Research Report (1.00)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.94)
Information Technology > Artificial Intelligence > Vision (0.68)
(2 more...)

Add feedback

ReXamine-Global: A Framework for Uncovering Inconsistencies in Radiology Report Generation Metrics

Banerjee, Oishi, Saenz, Agustina, Wu, Kay, Clements, Warren, Zia, Adil, Buensalido, Dominic, Kavnoudias, Helen, Abi-Ghanem, Alain S., Ghawi, Nour El, Luna, Cibele, Castillo, Patricia, Al-Surimi, Khaled, Daghistani, Rayyan A., Chen, Yuh-Min, Chao, Heng-sheng, Heiliger, Lars, Kim, Moon, Haubold, Johannes, Jonske, Frederic, Rajpurkar, Pranav

arXiv.org Artificial IntelligenceAug-28-2024

Given the rapidly expanding capabilities of generative AI models for radiology, there is a need for robust metrics that can accurately measure the quality of AI-generated radiology reports across diverse hospitals. We develop ReXamine-Global, a LLM-powered, multi-site framework that tests metrics across different writing styles and patient populations, exposing gaps in their generalization. First, our method tests whether a metric is undesirably sensitive to reporting style, providing different scores depending on whether AI-generated reports are stylistically similar to ground-truth reports or not. Second, our method measures whether a metric reliably agrees with experts, or whether metric and expert scores of AI-generated report quality diverge for some sites. Using 240 reports from 6 hospitals around the world, we apply ReXamine-Global to 7 established report evaluation metrics and uncover serious gaps in their generalizability. Developers can apply ReXamine-Global when designing new report evaluation metrics, ensuring their robustness across sites. Additionally, our analysis of existing metrics can guide users of those metrics towards evaluation procedures that work reliably at their sites of interest.

ground-truth report, metric, rexamine-global, (17 more...)

arXiv.org Artificial Intelligence

2408.16208

Country:

Europe > Germany > North Rhine-Westphalia (0.05)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
Asia > Taiwan > Taiwan Province > Taipei (0.04)
(7 more...)

Genre:

Research Report > New Finding (0.68)
Research Report > Experimental Study (0.47)

Industry:

Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.53)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.38)

Add feedback

Generative Debunking of Climate Misinformation

Zanartu, Francisco, Otmakhova, Yulia, Cook, John, Frermann, Lea

arXiv.org Artificial IntelligenceJul-8-2024

Misinformation about climate change causes numerous negative impacts, necessitating corrective responses. Psychological research has offered various strategies for reducing the influence of climate misinformation, such as the fact-myth-fallacy-fact-structure. However, practically implementing corrective interventions at scale represents a challenge. Automatic detection and correction of misinformation offers a solution to the misinformation problem. This study documents the development of large language models that accept as input a climate myth and produce a debunking that adheres to the fact-myth-fallacy-fact (``truth sandwich'') structure, by incorporating contrarian claim classification and fallacy detection into an LLM prompting framework. We combine open (Mixtral, Palm2) and proprietary (GPT-4) LLMs with prompting strategies of varying complexity. Experiments reveal promising performance of GPT-4 and Mixtral if combined with structured prompts. We identify specific challenges of debunking generation and human evaluation, and map out avenues for future work. We release a dataset of high-quality truth-sandwich debunkings, source code and a demo of the debunking system.

annotator, fallacy, misinformation, (14 more...)

arXiv.org Artificial Intelligence

2407.05599

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Maryland > Montgomery County > Gaithersburg (0.04)
(3 more...)

Genre: Research Report > New Finding (0.68)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Gallouédec, Quentin, Beeching, Edward, Romac, Clément, Dellandréa, Emmanuel

arXiv.org Artificial IntelligenceFeb-15-2024

The search for a general model that can operate seamlessly across multiple domains remains a key goal in machine learning research. The prevailing methodology in Reinforcement Learning (RL) typically limits models to a single task within a unimodal framework, a limitation that contrasts with the broader vision of a versatile, multi-domain model. In this paper, we present Jack of All Trades (JAT), a transformer-based model with a unique design optimized for handling sequential decision-making tasks and multimodal data types. The JAT model demonstrates its robust capabilities and versatility by achieving strong performance on very different RL benchmarks, along with promising results on Computer Vision (CV) and Natural Language Processing (NLP) tasks, all using a single set of weights. The JAT model marks a significant step towards more general, cross-domain AI model design, and notably, it is the first model of its kind to be fully open-sourced (see https://huggingface.co/jat-project/jat), including a pioneering general-purpose dataset.

agent, dataset, transformer, (15 more...)

arXiv.org Artificial Intelligence

2402.09844

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(12 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Sports (0.68)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.87)

Add feedback

Learn To be Efficient: Build Structured Sparsity in Large Language Models

Zheng, Haizhong, Bai, Xiaoyan, Chen, Beidi, Lai, Fan, Prakash, Atul

arXiv.org Artificial IntelligenceFeb-8-2024

Large Language Models (LLMs) have achieved remarkable success with their billion-level parameters, yet they incur high inference overheads. The emergence of activation sparsity in LLMs provides a natural approach to reduce this cost by involving only parts of the parameters for inference. Existing methods only focus on utilizing this naturally formed activation sparsity, overlooking the potential for further amplifying this inherent sparsity. In this paper, we hypothesize that LLMs can learn to be efficient by achieving more structured activation sparsity.To achieve this, we introduce a novel algorithm, Learn-To-be-Efficient (LTE), designed to train efficiency-aware LLMs to learn to activate fewer neurons and achieve a better trade-off between sparsity and performance. Furthermore, unlike SOTA MoEfication methods, which mainly focus on ReLU-based models, LTE can also be applied to LLMs like GPT and LLaMA with soft activation functions. We evaluate LTE on four models and eleven datasets. The experiments show that LTE achieves a better trade-off between sparsity and task performance. For instance, LTE with LLaMA provides a 1.83x-2.59x FLOPs speed-up on language generation tasks, outperforming the state-of-the-art methods.

dataset, neuron, sparsity, (13 more...)

arXiv.org Artificial Intelligence

2402.06126

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Michigan (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
(4 more...)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback